|
|
Accession Number |
TCMCG075C29415 |
gbkey |
CDS |
Protein Id |
XP_007009694.2 |
Location |
join(3445147..3445209,3445359..3445446,3447720..3447922,3449013..3449357,3450451..3450555,3450750..3450877,3452272..3452448,3452581..3452662,3453195..3453488,3453588..3453785,3453891..3453992,3454229..3454480,3454563..3454781,3455716..3455886,3456877..3457062,3457519..3457986,3458213..3458713) |
Gene |
LOC18586320 |
GeneID |
18586320 |
Organism |
Theobroma cacao |
|
|
Length |
1193aa |
Molecule type |
protein |
Topology |
linear |
Data_file_division |
PLN |
dblink |
BioProject:PRJNA341501 |
db_source |
XM_007009632.2
|
Definition |
PREDICTED: regulator of nonsense transcripts UPF2 isoform X1 [Theobroma cacao] |
CDS: ATGGATCACCATGAGGATGAATGTCGTGCTGGAGGTGAACACCATGGCAAACAAGATGATGAGGAAGCTGTTGCTCGTCTTGAGGAAATGAAGAAATCAATCGAGGGAAAAATGGCTCTTCGTCAAAGCAATTTGAATCCTGAGAGGCCTGACTCTGGGTTCCTCAGAACATTGGATTCCAGCATCAGGCGCAATACAGCTGTTATTAAAAAATTAAAGCAGATAAATGAAGAGCAGAAGGAAGGATTGATGGAGGAGCTACGAAGTGTTAATTTAAGCAAATTTGTTAGTGAAGCTGTGACTGCTATATGTGATGCCAAGCTTAAAAGTTCAGATATACAAGCTGCAGTTCAGATCTGCTCTTTGCTTAATCAAAGGTACAAAGATTTCTCACCAAGTCTGATACAAGGGCTCCTGAAAGTCTTCTTTCCGGGAAAATCTGGGGATGATTTGGATGCAGACAGGAACTTGAAGGCAATGAAGAAGCGTAGTACTTTGAAACTTCTTCTGGAACTTTACTTTGTTGGAGTTATAGAAGACAACGGAATCTTTATCAATATCATTAAGGATCTTACTAGCACAGAGCACTTGAAGGACCGAGATGCTACTCAGACAAATTTAACTCTTCTTGCTAGTTTTGCTCGACAAGGGCGAGTGTTTCTAGGGCTTCCAATTTCTGGACAAGAAATTCTAGAAGAGTTCTTTAAAGGCCTCAATATCACAGCAGATCAGAAGAAAACTTTTAGGAAGGCCTTCCATGCATATTATGATGCTGTCACTGAACTTCTTCAGTCTGAGCATGCGACACTTCGCCAGATGGAGCATGAGAATGCCAAGATTCTAAATGCTAAAGGAGAGCTCAATGAAGAAAATGCGTCTTCATATGAAAAGCTGCGAAAATCCTATGATCATTTGTACCGCAATGTCTCGTCTTTAGCAGAAGCACTTGATATGCAGTCTCCAGTGATGCCAGAGGACAGTCACACAACTAGGGTTACTACTGGAGAGGATGCTTCATCTCCTGCCACTGGAAAAGAGTCTTCCACCCTTGAAGCTATATGGGATGATGACGACACTAGAGCGTTCTATGAATGCTTACCAGATCTCAGAGCATTTGTCCCAGCAGTATTATTGGGAGAAGCTGAGCCCAAAGGGATCGAGCAAACGTCAAAGGCACAAGAGCAACCAACTGATTCCTCTACTGAAGCAGATCAAAGTACTGCAGTTGCCCAAGATGCTGTGGAGGCTTCTGCAGACTCTGGCAATTTGCAAGAAGGGAAAAGTATAGAGAAAGGAAAGGACAAAGAAGAAAAGGACAAAGAAAGGAATAAAGATCCAGACAAGGAGAAAGGGAAAGAAAAAGACTCCGATAAAAAAGGAGAGAATGAAAAGGAGAAGCTTAAAGGTCTTGAAGGAACGAATCTGGATGCTCTACTGCAAAGACTCCCAGGTTGTGTGAGCCGTGACCTCATTGATCAACTTACGGTGGAGTTCTGTTATTTGAATTCAAAATCAAATCGAAAAAGGCTTGTGAGAGCATTGTTTAATGTCCCAAGGACATCTTTGGAATTGCTGCCATACTACTCCCGCATGGTTGCAACATTGTCAACTTGTATGAAGGATGTTCCCTCTATGCTCTTGCAGATGTTGGAGGAAGAGTTCAACTTCTTAATTAATAAAAAGGATCAAATGAACATTGAAACAAAGATCAGGAATATAAGGTTTATTGGAGAACTTTGCAAGTTCAGGATTGCACCAGCTGGCCTTGTTTTCAGTTGTCTGAAGACATGTTTAGATGATTTCACTCATCATAACATTGATGTCGCTTGCAATCTTCTTGAGACATGTGGTCGTTTTCTATATCGTTCTCCTGAAACTACCATAAGAATGGCTAACATGTTGGAGATCTTGATGCGCTTGAAAAATGTAAAAAATTTGGATCCTCGACACAGCACACTTGTAGAAAATGCTTACTACCTGTGCAAGCCACCTGAAAGATCTGCACGAGTCTCTAAAGTCCGTCCACCATTGCACCAGTATATTAGAAAATTGCTATTTACAGATCTTGATAAGTCTTCCATTGAGCATGTGCTGAGGCAACTTCGTAAATTACCATGGAGTGAATGTGAATCATACCTCTTAAAGTGCTTCATGAAGGTTCACAAAGGGAAATATGGTCAGATTCACTTGATTGCTTCTCTCACTGCTGGTTTGAGTCGCTACCATGATGAATTTGCTGTTGCTGTTGTTGATGAGGTTTTGGAGGAGATTAGGCTTGGTCTGGAATTGAATGATTATGGGATGCAGCAGAGACGCATTGCTCATATGCGTTTTCTAGGGGAGCTATACAACTATGAGCATGTTGATTCTTCTGTCATCTTTGAGACACTCTATTTGATTCTTGTTTCTGGCCATGATACAGCAGAGCAAGATGTCCTCGATCCACCTGAGGATTGTTTTCGAATCAGGATGGTTATTACTCTTCTTCAGACATGTGGGCACTACTTTGACCGAGGTTCTTCCAAGAGAAAACTTGATAGATTCTTGATACACTTCCAGAGATATATTCTTAGCAAAGGTGCCTTACCACTGGATATTGAATTTGACTTGCAGGACTTATTTGCAGAATTACGTCCCAATATGACCCGGTATTCATCCATGGAAGAAGTTAATGCTGCTTTAGTAGAACTTGAGGAACACGAACGCACTGCTTCAACTGACAAAACAAGTAGTGAGAAGCACTCTGATACTGAAAAGCCTTCTAGCAGGACAACTGCCCATTCCATCTCAGGTGATCGACCAAGCATTTTTAATGGTTCTGAGGAAAATGGTGGAGTGCATGAGGAAACCGGTGACAGTGATAGTGAATCAGGGAGTGGCACCATTGAGCCAGAGGGTCATGATGAAGATTATTTAGATGAAGAGAATCCTGATGATGGATGTGATACTGATGAGGAGGATGAGGATGATGGTGGGCCTGCTTCTGACGAGGATGATGAAGTTCATGTCAGGCAGAAGGTAGCAGAGTTGGATCCTCAAGAAGTAGCCAATTTTGACCAGGAACTCAGGGCTGTAGTGCAGGAGAGTATGGAGCAGCGCAAGCTGGAGCTTCGTGGCCGACCTACACTAAATATGATGATACCAATGAATGTGTTCGAGGGTTCTACCAAAGATCATCATGGAAGGGTAGTTGGAGGGGAAAGTGGTGATGAAGCATTGGATGAAGAGGCTGGAGGAAGCAGAGAGGTTCAGGTGAAAGTTCTTGTGAAGCGAGGGAACAAACAACAGACGAAGCAAATGTATATTCCTCGTGATTGTACTCTTGTCCAGAGCACAAAACAGAAAGAAGCAGCCGAGTTTGAAGAGAAACAAGATATCAAGAGGCTGGTCTTGGAGTATAATGACAGGGTAGAGGAGGAGAATAATGGACTCGGAACGCAGACATTGAATTGGCCAAGTGGGAATAGCAGAGTTTACGGCCGTGGAAACTCTTGGGAAGGATCCAGCGGGAGGAGTGGTGGACCACGTCATCGGCATCATAGCCATTCAGGAAGCGGAGCTTTTTACGGCAGAAAAAAGTGA |
Protein: MDHHEDECRAGGEHHGKQDDEEAVARLEEMKKSIEGKMALRQSNLNPERPDSGFLRTLDSSIRRNTAVIKKLKQINEEQKEGLMEELRSVNLSKFVSEAVTAICDAKLKSSDIQAAVQICSLLNQRYKDFSPSLIQGLLKVFFPGKSGDDLDADRNLKAMKKRSTLKLLLELYFVGVIEDNGIFINIIKDLTSTEHLKDRDATQTNLTLLASFARQGRVFLGLPISGQEILEEFFKGLNITADQKKTFRKAFHAYYDAVTELLQSEHATLRQMEHENAKILNAKGELNEENASSYEKLRKSYDHLYRNVSSLAEALDMQSPVMPEDSHTTRVTTGEDASSPATGKESSTLEAIWDDDDTRAFYECLPDLRAFVPAVLLGEAEPKGIEQTSKAQEQPTDSSTEADQSTAVAQDAVEASADSGNLQEGKSIEKGKDKEEKDKERNKDPDKEKGKEKDSDKKGENEKEKLKGLEGTNLDALLQRLPGCVSRDLIDQLTVEFCYLNSKSNRKRLVRALFNVPRTSLELLPYYSRMVATLSTCMKDVPSMLLQMLEEEFNFLINKKDQMNIETKIRNIRFIGELCKFRIAPAGLVFSCLKTCLDDFTHHNIDVACNLLETCGRFLYRSPETTIRMANMLEILMRLKNVKNLDPRHSTLVENAYYLCKPPERSARVSKVRPPLHQYIRKLLFTDLDKSSIEHVLRQLRKLPWSECESYLLKCFMKVHKGKYGQIHLIASLTAGLSRYHDEFAVAVVDEVLEEIRLGLELNDYGMQQRRIAHMRFLGELYNYEHVDSSVIFETLYLILVSGHDTAEQDVLDPPEDCFRIRMVITLLQTCGHYFDRGSSKRKLDRFLIHFQRYILSKGALPLDIEFDLQDLFAELRPNMTRYSSMEEVNAALVELEEHERTASTDKTSSEKHSDTEKPSSRTTAHSISGDRPSIFNGSEENGGVHEETGDSDSESGSGTIEPEGHDEDYLDEENPDDGCDTDEEDEDDGGPASDEDDEVHVRQKVAELDPQEVANFDQELRAVVQESMEQRKLELRGRPTLNMMIPMNVFEGSTKDHHGRVVGGESGDEALDEEAGGSREVQVKVLVKRGNKQQTKQMYIPRDCTLVQSTKQKEAAEFEEKQDIKRLVLEYNDRVEEENNGLGTQTLNWPSGNSRVYGRGNSWEGSSGRSGGPRHRHHSHSGSGAFYGRKK |